智能论文笔记

An Artificial Intelligence Dataset for Solar Energy Locations in India

Anthony Ortiz , Dhaval Negandhi , Sagar R Mysorekar , Joseph Kiesecker , Shivaprakash K Nagaraju , Caleb Robinson , Priyal Bhatia , Aditi Khurana , Jane Wang , Felipe Oviedo

分类：机器学习

2022-01-31

可再生能源的快速开发，尤其是太阳能光伏（PV），对于缓解气候变化至关重要。结果，印度设定了雄心勃勃的目标，可以在2030年之前安装500吉瓦的太阳能容量。鉴于预计大量的足迹可以满足可再生能源能源目标，因此对环境价值的土地利用冲突的潜力很高。为了加快太阳能的发展，土地使用计划者将需要访问PV基础设施的最新，准确的地理空间信息。在这项工作中，我们开发了一种露骨的机器学习模型，以使用自由使用的卫星图像绘制印度的公用事业规模的太阳能项目，平均准确性为92％。我们的模型预测得到了人类专家的验证，以获取1363个太阳能光伏农场的数据集。使用此数据集，我们测量了整个印度的太阳足迹，并量化了与PV基础设施发展相关的土地盖修改程度。我们的分析表明，印度超过74％的太阳能发展是建立在具有自然生态系统保护或农业价值的陆生类型上的。

translated by 谷歌翻译

Learning Representations that Enable Generalization in Assistive Tasks

Jerry Zhi-Yang He , Aditi Raghunathan , Daniel S. Brown , Zackory Erickson , Anca D. Dragan

分类：机器学习 | 人工智能 | 机器人

2022-12-05

Recent work in sim2real has successfully enabled robots to act in physical environments by training in simulation with a diverse ''population'' of environments (i.e. domain randomization). In this work, we focus on enabling generalization in assistive tasks: tasks in which the robot is acting to assist a user (e.g. helping someone with motor impairments with bathing or with scratching an itch). Such tasks are particularly interesting relative to prior sim2real successes because the environment now contains a human who is also acting. This complicates the problem because the diversity of human users (instead of merely physical environment parameters) is more difficult to capture in a population, thus increasing the likelihood of encountering out-of-distribution (OOD) human policies at test time. We advocate that generalization to such OOD policies benefits from (1) learning a good latent representation for human policies that test-time humans can accurately be mapped to, and (2) making that representation adaptable with test-time interaction data, instead of relying on it to perfectly capture the space of human policies based on the simulated population only. We study how to best learn such a representation by evaluating on purposefully constructed OOD test policies. We find that sim2real methods that encode environment (or population) parameters and work well in tasks that robots do in isolation, do not work well in assistance. In assistance, it seems crucial to train the representation based on the history of interaction directly, because that is what the robot will have access to at test time. Further, training these representations to then predict human actions not only gives them better structure, but also enables them to be fine-tuned at test-time, when the robot observes the partner act. https://adaptive-caregiver.github.io.

translated by 谷歌翻译

Where the Bee Sucks -- A Dynamic Bayesian Network Approach to Decision Support for Pollinator Abundance Strategies

Martine J. Barons , Aditi Shenvi

分类：人工智能

2022-12-05

For policymakers wishing to make evidence-based decisions, one of the challenges is how to combine the relevant information and evidence in a coherent and defensible manner in order to formulate and evaluate candidate policies. Policymakers often need to rely on experts with disparate fields of expertise when making policy choices in complex, multi-faceted, dynamic environments such as those dealing with ecosystem services. The pressures affecting the survival and pollination capabilities of honey bees (Apis mellifera), wild bees and other pollinators is well-documented, but incomplete. In order to estimate the potential effectiveness of various candidate policies to support pollination services, there is an urgent need to quantify the effect of various combinations of variables on the pollination ecosystem service, utilising available information, models and expert judgement. In this paper, we present a new application of the integrating decision support system methodology for combining inputs from multiple panels of experts to evaluate policies to support an abundant pollinator population.

translated by 谷歌翻译

Finetune like you pretrain: Improved finetuning of zero-shot vision models

Sachin Goyal , Ananya Kumar , Sankalp Garg , Zico Kolter , Aditi Raghunathan

分类：计算机视觉 | 机器学习

2022-12-01

Finetuning image-text models such as CLIP achieves state-of-the-art accuracies on a variety of benchmarks. However, recent works like WiseFT (Wortsman et al., 2021) and LP-FT (Kumar et al., 2022) have shown that even subtle differences in the finetuning process can lead to surprisingly large differences in the final performance, both for in-distribution (ID) and out-of-distribution (OOD) data. In this work, we show that a natural and simple approach of mimicking contrastive pretraining consistently outperforms alternative finetuning approaches. Specifically, we cast downstream class labels as text prompts and continue optimizing the contrastive loss between image embeddings and class-descriptive prompt embeddings (contrastive finetuning). Our method consistently outperforms baselines across 7 distribution shifts, 6 transfer learning, and 3 few-shot learning benchmarks. On WILDS-iWILDCam, our proposed approach FLYP outperforms the top of the leaderboard by $2.3\%$ ID and $2.7\%$ OOD, giving the highest reported accuracy. Averaged across 7 OOD datasets (2 WILDS and 5 ImageNet associated shifts), FLYP gives gains of $4.2\%$ OOD over standard finetuning and outperforms the current state of the art (LP-FT) by more than $1\%$ both ID and OOD. Similarly, on 3 few-shot learning benchmarks, our approach gives gains up to $4.6\%$ over standard finetuning and $4.4\%$ over the state of the art. In total, these benchmarks establish contrastive finetuning as a simple, intuitive, and state-of-the-art approach for supervised finetuning of image-text models like CLIP. Code is available at https://github.com/locuslab/FLYP.

translated by 谷歌翻译

DeepG2P: Fusing Multi-Modal Data to Improve Crop Production

Swati Sharma , Aditi Partap , Maria Angels de Luis Balaguer , Sara Malvar , Ranveer Chandra

分类：机器学习

2022-11-11

Agriculture is at the heart of the solution to achieve sustainability in feeding the world population, but advancing our understanding on how agricultural output responds to climatic variability is still needed. Precision Agriculture (PA), which is a management strategy that uses technology such as remote sensing, Geographical Information System (GIS), and machine learning for decision making in the field, has emerged as a promising approach to enhance crop production, increase yield, and reduce water and nutrient losses and environmental impacts. In this context, multiple models to predict agricultural phenotypes, such as crop yield, from genomics (G), environment (E), weather and soil, and field management practices (M) have been developed. These models have traditionally been based on mechanistic or statistical approaches. However, AI approaches are intrinsically well-suited to model complex interactions and have more recently been developed, outperforming classical methods. Here, we present a Natural Language Processing (NLP)-based neural network architecture to process the G, E and M inputs and their interactions. We show that by modeling DNA as natural language, our approach performs better than previous approaches when tested for new environments and similarly to other approaches for unseen seed varieties.

translated by 谷歌翻译

Beyond Conjugacy for Chain Event Graph Model Selection

Aditi Shenvi , Silvia Liverani

分类： (统计)机器学习

2022-11-07

Chain event graphs are a family of probabilistic graphical models that generalise Bayesian networks and have been successfully applied to a wide range of domains. Unlike Bayesian networks, these models can encode context-specific conditional independencies as well as asymmetric developments within the evolution of a process. More recently, new model classes belonging to the chain event graph family have been developed for modelling time-to-event data to study the temporal dynamics of a process. However, existing model selection algorithms for chain event graphs and its variants rely on all parameters having conjugate priors. This is unrealistic for many real-world applications. In this paper, we propose a mixture modelling approach to model selection in chain event graphs that does not rely on conjugacy. Moreover, we also show that this methodology is more amenable to being robustly scaled than the existing model selection algorithms used for this family. We demonstrate our techniques on simulated datasets.

translated by 谷歌翻译

BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in Video

Ali Athar , Jonathon Luiten , Paul Voigtlaender , Tarasha Khurana , Achal Dave , Bastian Leibe , Deva Ramanan

分类：计算机视觉

2022-09-25

多个现有基准测试涉及视频中的跟踪和分割对象，例如，视频对象细分（VOS）和多对象跟踪和分割（MOTS）（MOTS），但是由于使用不同的基准标准数据集和指标，它们之间几乎没有相互作用（例如J＆F，J＆F，J＆F，J＆F，地图，smotsa）。结果，已发表的作品通常针对特定的基准，并且不容易相互媲美。我们认为，可以解决多个任务的广义方法的发展需要在这些研究子社区中更大的凝聚力。在本文中，我们旨在通过提出爆发来促进这一点，该数据集包含数千个带有高质量对象掩码的视频，以及一个相关的基准标准，其中包含六个任务，涉及视频中的对象跟踪和细分。使用相同的数据和可比较的指标对所有任务进行评估，这使研究人员能够一致考虑它们，因此更有效地从不同任务的不同方法中汇集了知识。此外，我们为所有任务展示了几个基线，并证明可以将一个任务的方法应用于另一个任务，并具有可量化且可解释的性能差异。数据集注释和评估代码可在以下网址获得：https：//github.com/ali2500/burst-benchmark。

translated by 谷歌翻译

Will It Blend? Mixing Training Paradigms & Prompting for Argument Quality Prediction

Michiel van der Meer , Myrthe Reuver , Urja Khurana , Lea Krause , Selene Báez Santamaría

分类：自然语言处理 | 人工智能

2022-09-19

本文描述了我们对第9届论证挖掘研讨会共同任务的贡献（2022）。我们的方法使用大型语言模型来进行论证质量预测的任务。我们使用GPT-3进行及时的工程，并研究培训范式多任务学习，对比度学习和中任务培训。我们发现混合预测设置优于单个模型。提示GPT-3最适合预测论点有效性，而论证新颖性最好通过使用所有三个训练范式训练的模型来估算。

translated by 谷歌翻译

Can GAN-induced Attribute Manipulations Impact Face Recognition?

Sudipta Banerjee , Aditi Aggarwal , Arun Ross

分类：计算机视觉

2022-09-07

由于人口统计因素（例如年龄，性别，种族等）的影响，已经在自动化的面部识别系统中进行了广泛的研究。但是，\ textIt {数字修改}的人口统计学和面部属性对面部识别的影响相对较小。在这项工作中，我们研究了通过生成对抗网络（GAN）引起的属性操作的影响对面部识别性能。我们通过使用Attgan和Stgan有意修改13个属性，并评估它们对两种基于深度学习的面部验证方法，Arcface和VGGFACE的影响，在Celeba数据集上进行实验。我们的发现表明，涉及眼镜和性线索的数字变化的一些属性操纵可能会大大损害面部识别多达73％，需要进一步分析。

translated by 谷歌翻译

ATP: A holistic attention integrated approach to enhance ABSA

Ashish Kumar , Vasundhra Dahiya , Aditi Sharan

分类：自然语言处理

2022-08-04

基于方面的情感分析（ABSA）涉及审查句子对给定方面的情感极性的识别。 RNN，LSTM和GRU等深度学习顺序模型是推断情感极性的当前最新方法。这些方法可以很好地捕获评论句子的单词之间的上下文关系。但是，这些方法在捕获长期依赖性方面微不足道。注意机制仅专注于句子的最关键部分，从而发挥着重要作用。在ABSA的情况下，方面位置起着至关重要的作用。在确定对该方面的情绪的同时，近乎方面的单词会做出更多的贡献。因此，我们提出了一种使用依赖解析树捕获基于位置信息的方法，并有助于注意机制。使用这种类型的位置信息通过简单的基于单词距离的位置增强了深度学习模型的性能。我们对Semeval'14数据集进行了实验，以证明基于ABSA的基于ABS的依赖关系的效果。

translated by 谷歌翻译